Overview
Brought to you by YData
Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 18897 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.4 MiB |
| Average record size in memory | 80.0 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 1 |
households is highly overall correlated with population and 2 other fields | High correlation |
latitude is highly overall correlated with longitude | High correlation |
longitude is highly overall correlated with latitude | High correlation |
median_house_value is highly overall correlated with median_income | High correlation |
median_income is highly overall correlated with median_house_value | High correlation |
population is highly overall correlated with households and 2 other fields | High correlation |
total_bedrooms is highly overall correlated with households and 2 other fields | High correlation |
total_rooms is highly overall correlated with households and 2 other fields | High correlation |
Reproduction
| Analysis started | 2025-03-03 01:33:32.495059 |
|---|---|
| Analysis finished | 2025-03-03 01:33:53.771983 |
| Duration | 21.28 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
longitude
Real number (ℝ)
High correlation 
| Distinct | 838 |
|---|---|
| Distinct (%) | 4.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -119.59418 |
| Minimum | -124.35 |
|---|---|
| Maximum | -114.31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 18897 |
| Negative (%) | 100.0% |
| Memory size | 147.8 KiB |
Quantile statistics
| Minimum | -124.35 |
|---|---|
| 5-th percentile | -122.47 |
| Q1 | -121.8 |
| median | -118.53 |
| Q3 | -118.02 |
| 95-th percentile | -117.08 |
| Maximum | -114.31 |
| Range | 10.04 |
| Interquartile range (IQR) | 3.78 |
Descriptive statistics
| Standard deviation | 2.0038145 |
|---|---|
| Coefficient of variation (CV) | -0.016755118 |
| Kurtosis | -1.3280542 |
| Mean | -119.59418 |
| Median Absolute Deviation (MAD) | 1.31 |
| Skewness | -0.28170677 |
| Sum | -2259971.1 |
| Variance | 4.0152725 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -118.31 | 153 | 0.8% |
| -118.3 | 141 | 0.7% |
| -118.27 | 136 | 0.7% |
| -118.32 | 134 | 0.7% |
| -118.28 | 133 | 0.7% |
| -118.29 | 131 | 0.7% |
| -118.36 | 130 | 0.7% |
| -118.19 | 128 | 0.7% |
| -118.35 | 127 | 0.7% |
| -118.14 | 122 | 0.6% |
| Other values (828) | 17562 |
| Value | Count | Frequency (%) |
| -124.35 | 1 | < 0.1% |
| -124.3 | 2 | < 0.1% |
| -124.27 | 1 | < 0.1% |
| -124.26 | 1 | < 0.1% |
| -124.25 | 1 | < 0.1% |
| -124.23 | 3 | |
| -124.22 | 1 | < 0.1% |
| -124.21 | 3 | |
| -124.19 | 4 | |
| -124.18 | 6 |
| Value | Count | Frequency (%) |
| -114.31 | 1 | < 0.1% |
| -114.49 | 1 | < 0.1% |
| -114.55 | 1 | < 0.1% |
| -114.56 | 1 | < 0.1% |
| -114.57 | 3 | |
| -114.58 | 2 | |
| -114.59 | 1 | < 0.1% |
| -114.6 | 3 | |
| -114.61 | 3 | |
| -114.62 | 1 | < 0.1% |
latitude
Real number (ℝ)
High correlation 
| Distinct | 859 |
|---|---|
| Distinct (%) | 4.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.66329 |
| Minimum | 32.54 |
|---|---|
| Maximum | 41.95 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 147.8 KiB |
Quantile statistics
| Minimum | 32.54 |
|---|---|
| 5-th percentile | 32.81 |
| Q1 | 33.94 |
| median | 34.28 |
| Q3 | 37.73 |
| 95-th percentile | 39.04 |
| Maximum | 41.95 |
| Range | 9.41 |
| Interquartile range (IQR) | 3.79 |
Descriptive statistics
| Standard deviation | 2.1497734 |
|---|---|
| Coefficient of variation (CV) | 0.060279728 |
| Kurtosis | -1.1267725 |
| Mean | 35.66329 |
| Median Absolute Deviation (MAD) | 1.35 |
| Skewness | 0.4461914 |
| Sum | 673929.2 |
| Variance | 4.6215259 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 34.05 | 214 | 1.1% |
| 34.08 | 213 | 1.1% |
| 34.06 | 212 | 1.1% |
| 34.04 | 195 | 1.0% |
| 34.09 | 194 | 1.0% |
| 34.07 | 193 | 1.0% |
| 34.02 | 189 | 1.0% |
| 34.03 | 175 | 0.9% |
| 33.93 | 172 | 0.9% |
| 34.1 | 169 | 0.9% |
| Other values (849) | 16971 |
| Value | Count | Frequency (%) |
| 32.54 | 1 | < 0.1% |
| 32.55 | 2 | < 0.1% |
| 32.56 | 9 | < 0.1% |
| 32.57 | 16 | |
| 32.58 | 26 | |
| 32.59 | 11 | |
| 32.6 | 8 | < 0.1% |
| 32.61 | 11 | |
| 32.62 | 11 | |
| 32.63 | 18 |
| Value | Count | Frequency (%) |
| 41.95 | 2 | |
| 41.92 | 1 | < 0.1% |
| 41.88 | 1 | < 0.1% |
| 41.86 | 3 | |
| 41.84 | 1 | < 0.1% |
| 41.82 | 1 | < 0.1% |
| 41.81 | 2 | |
| 41.8 | 3 | |
| 41.79 | 1 | < 0.1% |
| 41.78 | 3 |
housing_median_age
Real number (ℝ)
| Distinct | 52 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.332698 |
| Minimum | 1 |
|---|---|
| Maximum | 52 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 147.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 19 |
| median | 30 |
| Q3 | 38 |
| 95-th percentile | 52 |
| Maximum | 52 |
| Range | 51 |
| Interquartile range (IQR) | 19 |
Descriptive statistics
| Standard deviation | 12.390898 |
|---|---|
| Coefficient of variation (CV) | 0.42242614 |
| Kurtosis | -0.7921755 |
| Mean | 29.332698 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 0.025443754 |
| Sum | 554300 |
| Variance | 153.53436 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 52 | 1214 | 6.4% |
| 36 | 836 | 4.4% |
| 35 | 803 | 4.2% |
| 16 | 702 | 3.7% |
| 34 | 666 | 3.5% |
| 17 | 618 | 3.3% |
| 33 | 591 | 3.1% |
| 26 | 576 | 3.0% |
| 32 | 538 | 2.8% |
| 37 | 517 | 2.7% |
| Other values (42) | 11836 |
| Value | Count | Frequency (%) |
| 1 | 4 | < 0.1% |
| 2 | 45 | 0.2% |
| 3 | 38 | 0.2% |
| 4 | 124 | |
| 5 | 183 | |
| 6 | 118 | |
| 7 | 120 | |
| 8 | 166 | |
| 9 | 175 | |
| 10 | 229 |
| Value | Count | Frequency (%) |
| 52 | 1214 | |
| 51 | 47 | 0.2% |
| 50 | 130 | 0.7% |
| 49 | 131 | 0.7% |
| 48 | 169 | 0.9% |
| 47 | 193 | 1.0% |
| 46 | 237 | 1.3% |
| 45 | 281 | 1.5% |
| 44 | 344 | 1.8% |
| 43 | 344 | 1.8% |
total_rooms
Real number (ℝ)
High correlation 
| Distinct | 4980 |
|---|---|
| Distinct (%) | 26.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2265.1829 |
| Minimum | 2 |
|---|---|
| Maximum | 8874 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 147.8 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 604.8 |
| Q1 | 1408 |
| median | 2037 |
| Q3 | 2898 |
| 95-th percentile | 4658.2 |
| Maximum | 8874 |
| Range | 8872 |
| Interquartile range (IQR) | 1490 |
Descriptive statistics
| Standard deviation | 1249.5242 |
|---|---|
| Coefficient of variation (CV) | 0.55162179 |
| Kurtosis | 1.5942679 |
| Mean | 2265.1829 |
| Median Absolute Deviation (MAD) | 719 |
| Skewness | 1.0658858 |
| Sum | 42805161 |
| Variance | 1561310.8 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1527 | 18 | 0.1% |
| 1613 | 17 | 0.1% |
| 1582 | 16 | 0.1% |
| 2127 | 16 | 0.1% |
| 1722 | 15 | 0.1% |
| 1717 | 15 | 0.1% |
| 1471 | 15 | 0.1% |
| 1703 | 15 | 0.1% |
| 1607 | 15 | 0.1% |
| 2053 | 14 | 0.1% |
| Other values (4970) | 18741 |
| Value | Count | Frequency (%) |
| 2 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 12 | 1 | < 0.1% |
| 15 | 2 | |
| 16 | 1 | < 0.1% |
| 18 | 4 | |
| 19 | 1 | < 0.1% |
| 20 | 2 | |
| 21 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8874 | 1 | |
| 8806 | 1 | |
| 8803 | 1 | |
| 8259 | 1 | |
| 8254 | 1 | |
| 8206 | 1 | |
| 8146 | 1 | |
| 8072 | 1 | |
| 8020 | 1 | |
| 8005 | 1 |
total_bedrooms
Real number (ℝ)
High correlation 
| Distinct | 1269 |
|---|---|
| Distinct (%) | 6.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 464.13404 |
| Minimum | 2 |
|---|---|
| Maximum | 1444 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 147.8 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 134 |
| Q1 | 290 |
| median | 419 |
| Q3 | 600 |
| 95-th percentile | 949 |
| Maximum | 1444 |
| Range | 1442 |
| Interquartile range (IQR) | 310 |
Descriptive statistics
| Standard deviation | 244.20534 |
|---|---|
| Coefficient of variation (CV) | 0.52615262 |
| Kurtosis | 0.54127019 |
| Mean | 464.13404 |
| Median Absolute Deviation (MAD) | 147 |
| Skewness | 0.82971245 |
| Sum | 8770741 |
| Variance | 59636.25 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 280 | 53 | 0.3% |
| 345 | 49 | 0.3% |
| 393 | 49 | 0.3% |
| 331 | 49 | 0.3% |
| 343 | 48 | 0.3% |
| 348 | 48 | 0.3% |
| 309 | 47 | 0.2% |
| 394 | 47 | 0.2% |
| 328 | 47 | 0.2% |
| 272 | 47 | 0.2% |
| Other values (1259) | 18413 |
| Value | Count | Frequency (%) |
| 2 | 2 | < 0.1% |
| 3 | 5 | |
| 4 | 6 | |
| 5 | 4 | |
| 6 | 4 | |
| 7 | 6 | |
| 8 | 7 | |
| 9 | 7 | |
| 10 | 8 | |
| 11 | 9 |
| Value | Count | Frequency (%) |
| 1444 | 1 | |
| 1438 | 1 | |
| 1432 | 1 | |
| 1424 | 1 | |
| 1423 | 1 | |
| 1410 | 1 | |
| 1409 | 1 | |
| 1404 | 1 | |
| 1401 | 2 | |
| 1395 | 1 |
population
Real number (ℝ)
High correlation 
| Distinct | 3051 |
|---|---|
| Distinct (%) | 16.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1238.5804 |
| Minimum | 3 |
|---|---|
| Maximum | 3580 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 147.8 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 339 |
| Q1 | 770 |
| median | 1126 |
| Q3 | 1599 |
| 95-th percentile | 2563.2 |
| Maximum | 3580 |
| Range | 3577 |
| Interquartile range (IQR) | 829 |
Descriptive statistics
| Standard deviation | 663.3777 |
|---|---|
| Coefficient of variation (CV) | 0.5355952 |
| Kurtosis | 0.65268337 |
| Mean | 1238.5804 |
| Median Absolute Deviation (MAD) | 400 |
| Skewness | 0.85445874 |
| Sum | 23405453 |
| Variance | 440069.97 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 891 | 25 | 0.1% |
| 1052 | 24 | 0.1% |
| 1227 | 24 | 0.1% |
| 761 | 23 | 0.1% |
| 850 | 23 | 0.1% |
| 1005 | 22 | 0.1% |
| 782 | 22 | 0.1% |
| 872 | 21 | 0.1% |
| 999 | 21 | 0.1% |
| 753 | 21 | 0.1% |
| Other values (3041) | 18671 |
| Value | Count | Frequency (%) |
| 3 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 8 | 4 | |
| 9 | 2 | |
| 11 | 1 | < 0.1% |
| 13 | 2 | |
| 14 | 3 | |
| 15 | 2 | |
| 17 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3580 | 2 | |
| 3574 | 1 | |
| 3572 | 1 | |
| 3570 | 2 | |
| 3569 | 1 | |
| 3567 | 1 | |
| 3566 | 1 | |
| 3565 | 1 | |
| 3563 | 1 | |
| 3562 | 2 |
households
Real number (ℝ)
High correlation 
| Distinct | 1137 |
|---|---|
| Distinct (%) | 6.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 433.16505 |
| Minimum | 2 |
|---|---|
| Maximum | 1157 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 147.8 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 123 |
| Q1 | 275 |
| median | 394 |
| Q3 | 560 |
| 95-th percentile | 876 |
| Maximum | 1157 |
| Range | 1155 |
| Interquartile range (IQR) | 285 |
Descriptive statistics
| Standard deviation | 224.58173 |
|---|---|
| Coefficient of variation (CV) | 0.51846688 |
| Kurtosis | 0.33979972 |
| Mean | 433.16505 |
| Median Absolute Deviation (MAD) | 136 |
| Skewness | 0.74943087 |
| Sum | 8185520 |
| Variance | 50436.955 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 306 | 57 | 0.3% |
| 386 | 55 | 0.3% |
| 429 | 54 | 0.3% |
| 335 | 54 | 0.3% |
| 282 | 54 | 0.3% |
| 284 | 51 | 0.3% |
| 297 | 51 | 0.3% |
| 362 | 50 | 0.3% |
| 340 | 50 | 0.3% |
| 330 | 49 | 0.3% |
| Other values (1127) | 18372 |
| Value | Count | Frequency (%) |
| 2 | 3 | < 0.1% |
| 3 | 4 | |
| 4 | 3 | < 0.1% |
| 5 | 6 | |
| 6 | 4 | |
| 7 | 9 | |
| 8 | 8 | |
| 9 | 8 | |
| 10 | 6 | |
| 11 | 4 |
| Value | Count | Frequency (%) |
| 1157 | 1 | < 0.1% |
| 1153 | 1 | < 0.1% |
| 1152 | 1 | < 0.1% |
| 1151 | 4 | |
| 1150 | 3 | |
| 1149 | 1 | < 0.1% |
| 1148 | 1 | < 0.1% |
| 1147 | 3 | |
| 1146 | 2 | |
| 1144 | 2 |
median_income
Real number (ℝ)
High correlation 
| Distinct | 11715 |
|---|---|
| Distinct (%) | 62.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.7332881 |
| Minimum | 0.4999 |
|---|---|
| Maximum | 9.6062 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 147.8 KiB |
Quantile statistics
| Minimum | 0.4999 |
|---|---|
| 5-th percentile | 1.57486 |
| Q1 | 2.5391 |
| median | 3.5 |
| Q3 | 4.6629 |
| 95-th percentile | 6.81096 |
| Maximum | 9.6062 |
| Range | 9.1063 |
| Interquartile range (IQR) | 2.1238 |
Descriptive statistics
| Standard deviation | 1.6139922 |
|---|---|
| Coefficient of variation (CV) | 0.43232456 |
| Kurtosis | 0.45394414 |
| Mean | 3.7332881 |
| Median Absolute Deviation (MAD) | 1.0458 |
| Skewness | 0.79035605 |
| Sum | 70547.946 |
| Variance | 2.6049707 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.125 | 48 | 0.3% |
| 4.125 | 44 | 0.2% |
| 2.625 | 44 | 0.2% |
| 2.875 | 44 | 0.2% |
| 3.875 | 40 | 0.2% |
| 4 | 36 | 0.2% |
| 3 | 36 | 0.2% |
| 3.375 | 36 | 0.2% |
| 3.625 | 36 | 0.2% |
| 4.375 | 33 | 0.2% |
| Other values (11705) | 18500 |
| Value | Count | Frequency (%) |
| 0.4999 | 12 | |
| 0.536 | 10 | |
| 0.5495 | 1 | < 0.1% |
| 0.6433 | 1 | < 0.1% |
| 0.6825 | 1 | < 0.1% |
| 0.6831 | 1 | < 0.1% |
| 0.696 | 1 | < 0.1% |
| 0.6991 | 1 | < 0.1% |
| 0.7007 | 1 | < 0.1% |
| 0.7025 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9.6062 | 1 | |
| 9.6047 | 1 | |
| 9.6023 | 1 | |
| 9.5908 | 1 | |
| 9.5862 | 1 | |
| 9.5823 | 1 | |
| 9.5561 | 1 | |
| 9.5551 | 1 | |
| 9.532 | 1 | |
| 9.5271 | 1 |
median_house_value
Real number (ℝ)
High correlation 
| Distinct | 3768 |
|---|---|
| Distinct (%) | 19.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 201590.1 |
| Minimum | 14999 |
|---|---|
| Maximum | 500001 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 147.8 KiB |
Quantile statistics
| Minimum | 14999 |
|---|---|
| 5-th percentile | 65400 |
| Q1 | 116900 |
| median | 176300 |
| Q3 | 258300 |
| 95-th percentile | 446200 |
| Maximum | 500001 |
| Range | 485002 |
| Interquartile range (IQR) | 141400 |
Descriptive statistics
| Standard deviation | 111062.13 |
|---|---|
| Coefficient of variation (CV) | 0.55093047 |
| Kurtosis | 0.42947719 |
| Mean | 201590.1 |
| Median Absolute Deviation (MAD) | 67100 |
| Skewness | 0.97918288 |
| Sum | 3.809448 × 109 |
| Variance | 1.2334796 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 500001 | 639 | 3.4% |
| 137500 | 116 | 0.6% |
| 162500 | 113 | 0.6% |
| 112500 | 97 | 0.5% |
| 187500 | 87 | 0.5% |
| 225000 | 84 | 0.4% |
| 87500 | 76 | 0.4% |
| 350000 | 72 | 0.4% |
| 150000 | 62 | 0.3% |
| 175000 | 62 | 0.3% |
| Other values (3758) | 17489 |
| Value | Count | Frequency (%) |
| 14999 | 4 | |
| 17500 | 1 | < 0.1% |
| 22500 | 3 | |
| 25000 | 1 | < 0.1% |
| 26600 | 1 | < 0.1% |
| 26900 | 1 | < 0.1% |
| 27500 | 1 | < 0.1% |
| 30000 | 2 | |
| 32500 | 4 | |
| 32900 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 500001 | 639 | |
| 500000 | 26 | 0.1% |
| 499100 | 1 | < 0.1% |
| 499000 | 1 | < 0.1% |
| 498800 | 1 | < 0.1% |
| 498700 | 1 | < 0.1% |
| 498400 | 1 | < 0.1% |
| 497600 | 1 | < 0.1% |
| 497400 | 1 | < 0.1% |
| 496400 | 2 | < 0.1% |
ocean_proximity
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.8 KiB |
| <1H OCEAN | |
|---|---|
| INLAND | |
| NEAR OCEAN | |
| NEAR BAY | |
| ISLAND | 5 |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 8.0540827 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NEAR BAY |
|---|---|
| 2nd row | NEAR BAY |
| 3rd row | NEAR BAY |
| 4th row | NEAR BAY |
| 5th row | NEAR BAY |
Common Values
| Value | Count | Frequency (%) |
| <1H OCEAN | 8278 | |
| INLAND | 6067 | |
| NEAR OCEAN | 2444 | 12.9% |
| NEAR BAY | 2103 | 11.1% |
| ISLAND | 5 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ocean | 10722 | |
| 1h | 8278 | |
| inland | 6067 | |
| near | 4547 | |
| bay | 2103 | 6.6% |
| island | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 27408 | |
| A | 23444 | |
| E | 15269 | |
| 12825 | ||
| O | 10722 | 7.0% |
| C | 10722 | 7.0% |
| < | 8278 | 5.4% |
| 1 | 8278 | 5.4% |
| H | 8278 | 5.4% |
| I | 6072 | 4.0% |
| Other values (6) | 20902 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 122817 | |
| Space Separator | 12825 | 8.4% |
| Math Symbol | 8278 | 5.4% |
| Decimal Number | 8278 | 5.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 27408 | |
| A | 23444 | |
| E | 15269 | |
| O | 10722 | 8.7% |
| C | 10722 | 8.7% |
| H | 8278 | 6.7% |
| I | 6072 | 4.9% |
| L | 6072 | 4.9% |
| D | 6072 | 4.9% |
| R | 4547 | 3.7% |
| Other values (3) | 4211 | 3.4% |
Space Separator
| Value | Count | Frequency (%) |
| 12825 |
Math Symbol
| Value | Count | Frequency (%) |
| < | 8278 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 8278 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 122817 | |
| Common | 29381 | 19.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 27408 | |
| A | 23444 | |
| E | 15269 | |
| O | 10722 | 8.7% |
| C | 10722 | 8.7% |
| H | 8278 | 6.7% |
| I | 6072 | 4.9% |
| L | 6072 | 4.9% |
| D | 6072 | 4.9% |
| R | 4547 | 3.7% |
| Other values (3) | 4211 | 3.4% |
Common
| Value | Count | Frequency (%) |
| 12825 | ||
| < | 8278 | |
| 1 | 8278 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 152198 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 27408 | |
| A | 23444 | |
| E | 15269 | |
| 12825 | ||
| O | 10722 | 7.0% |
| C | 10722 | 7.0% |
| < | 8278 | 5.4% |
| 1 | 8278 | 5.4% |
| H | 8278 | 5.4% |
| I | 6072 | 4.0% |
| Other values (6) | 20902 |
Interactions
Correlations
| households | housing_median_age | latitude | longitude | median_house_value | median_income | ocean_proximity | population | total_bedrooms | total_rooms | |
|---|---|---|---|---|---|---|---|---|---|---|
| households | 1.000 | -0.220 | -0.067 | 0.043 | 0.123 | 0.036 | 0.064 | 0.887 | 0.972 | 0.895 |
| housing_median_age | -0.220 | 1.000 | 0.023 | -0.142 | 0.086 | -0.141 | 0.196 | -0.223 | -0.248 | -0.303 |
| latitude | -0.067 | 0.023 | 1.000 | -0.880 | -0.174 | -0.092 | 0.475 | -0.125 | -0.046 | -0.003 |
| longitude | 0.043 | -0.142 | -0.880 | 1.000 | -0.059 | -0.003 | 0.430 | 0.116 | 0.044 | 0.020 |
| median_house_value | 0.123 | 0.086 | -0.174 | -0.059 | 1.000 | 0.668 | 0.303 | -0.001 | 0.095 | 0.210 |
| median_income | 0.036 | -0.141 | -0.092 | -0.003 | 0.668 | 1.000 | 0.129 | 0.007 | -0.003 | 0.288 |
| ocean_proximity | 0.064 | 0.196 | 0.475 | 0.430 | 0.303 | 0.129 | 1.000 | 0.077 | 0.047 | 0.031 |
| population | 0.887 | -0.223 | -0.125 | 0.116 | -0.001 | 0.007 | 0.077 | 1.000 | 0.849 | 0.789 |
| total_bedrooms | 0.972 | -0.248 | -0.046 | 0.044 | 0.095 | -0.003 | 0.047 | 0.849 | 1.000 | 0.904 |
| total_rooms | 0.895 | -0.303 | -0.003 | 0.020 | 0.210 | 0.288 | 0.031 | 0.789 | 0.904 | 1.000 |
Missing values
Sample
| longitude | latitude | housing_median_age | total_rooms | total_bedrooms | population | households | median_income | median_house_value | ocean_proximity | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | -122.23 | 37.88 | 41.0 | 880.0 | 129.0 | 322.0 | 126.0 | 8.3252 | 452600.0 | NEAR BAY |
| 1 | -122.22 | 37.86 | 21.0 | 7099.0 | 1106.0 | 2401.0 | 1138.0 | 8.3014 | 358500.0 | NEAR BAY |
| 2 | -122.24 | 37.85 | 52.0 | 1467.0 | 190.0 | 496.0 | 177.0 | 7.2574 | 352100.0 | NEAR BAY |
| 3 | -122.25 | 37.85 | 52.0 | 1274.0 | 235.0 | 558.0 | 219.0 | 5.6431 | 341300.0 | NEAR BAY |
| 4 | -122.25 | 37.85 | 52.0 | 1627.0 | 280.0 | 565.0 | 259.0 | 3.8462 | 342200.0 | NEAR BAY |
| 5 | -122.25 | 37.85 | 52.0 | 919.0 | 213.0 | 413.0 | 193.0 | 4.0368 | 269700.0 | NEAR BAY |
| 6 | -122.25 | 37.84 | 52.0 | 2535.0 | 489.0 | 1094.0 | 514.0 | 3.6591 | 299200.0 | NEAR BAY |
| 7 | -122.25 | 37.84 | 52.0 | 3104.0 | 687.0 | 1157.0 | 647.0 | 3.1200 | 241400.0 | NEAR BAY |
| 8 | -122.26 | 37.84 | 42.0 | 2555.0 | 665.0 | 1206.0 | 595.0 | 2.0804 | 226700.0 | NEAR BAY |
| 9 | -122.25 | 37.84 | 52.0 | 3549.0 | 707.0 | 1551.0 | 714.0 | 3.6912 | 261100.0 | NEAR BAY |
| longitude | latitude | housing_median_age | total_rooms | total_bedrooms | population | households | median_income | median_house_value | ocean_proximity | |
|---|---|---|---|---|---|---|---|---|---|---|
| 18887 | -121.32 | 39.29 | 11.0 | 2640.0 | 505.0 | 1257.0 | 445.0 | 3.5673 | 112000.0 | INLAND |
| 18888 | -121.40 | 39.33 | 15.0 | 2655.0 | 493.0 | 1200.0 | 432.0 | 3.5179 | 107200.0 | INLAND |
| 18889 | -121.45 | 39.26 | 15.0 | 2319.0 | 416.0 | 1047.0 | 385.0 | 3.1250 | 115600.0 | INLAND |
| 18890 | -121.53 | 39.19 | 27.0 | 2080.0 | 412.0 | 1082.0 | 382.0 | 2.5495 | 98300.0 | INLAND |
| 18891 | -121.56 | 39.27 | 28.0 | 2332.0 | 395.0 | 1041.0 | 344.0 | 3.7125 | 116800.0 | INLAND |
| 18892 | -121.09 | 39.48 | 25.0 | 1665.0 | 374.0 | 845.0 | 330.0 | 1.5603 | 78100.0 | INLAND |
| 18893 | -121.21 | 39.49 | 18.0 | 697.0 | 150.0 | 356.0 | 114.0 | 2.5568 | 77100.0 | INLAND |
| 18894 | -121.22 | 39.43 | 17.0 | 2254.0 | 485.0 | 1007.0 | 433.0 | 1.7000 | 92300.0 | INLAND |
| 18895 | -121.32 | 39.43 | 18.0 | 1860.0 | 409.0 | 741.0 | 349.0 | 1.8672 | 84700.0 | INLAND |
| 18896 | -121.24 | 39.37 | 16.0 | 2785.0 | 616.0 | 1387.0 | 530.0 | 2.3886 | 89400.0 | INLAND |